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Abstract Following previous theoretical work by Srinivasan (FOCS 2001) and the first 
author (STACS 2006) and a first experimental evaluation on random instances (ALENEX 
2009), we investigate how the recently developed different approaches to generate random- 
ized roundings satisfying disjoint cardinality constraints behave when used in two classical 
algorithmic problems, namely low-congestion routing in networks and max-coverage prob- 
lems in hypergraphs. 

We generally find that all randomized rounding algorithms work well, much better than 
what is guaranteed by existing theoretical work. The derandomized versions produce again 
significantly better rounding errors, with running times still negligible compared to the one 
for solving the corresponding LP. It thus seems worth preferring them over the randomized 
variants. 
r/\ The data created in these experiments lets us propose and investigate the following new ideas. 

QFor the low-congestion routing problems, we suggest to solve a second LP, which yields the 
same congestion, but aims at producing a solution that is easier to round. Experiments show 
(X3 that this reduces the rounding errors considerably, both in combination with randomized and 

x ^ | derandomized rounding. 

For the max-coverage instances, we generally observe that the greedy heuristics also per- 

I forms very good. We develop a strengthened method of derandomized rounding, and a sim- 

^. pie greedy/rounding hybrid approach using greedy and LP-based rounding elements, and ob- 

f"\l serve that both these improvements yield again better solutions than both earlier approaches 

[~ — . on their own. 

(T") For an important special case of max-coverage, namely unit disk max-domination, we also 

(""*) develop a PTAS. Contrary to all other algorithms investigated, it performs not much better 

p. | in experiments than in theory. In consequence, unless extremely good solutions are to be ob- 

j^~s tained with huge computational resources, greedy, LP-based rounding or hybrid approaches 

/— ^ are preferable. 
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1 Introduction 



Randomized rounding is one of the core primitives in randomized algorithmics. In contrast to 
many deep theoretical results, only very little experimental knowledge exists, and almost no fine- 
tuning and other implementation advice exists. Such results became even more interesting, since 
in the last ten years two substantially different methods [4, 7, 8, 21] extending the classical ap- 
proach of Raghavan and Thompson [19, 20] were developed. 

The only experimental work on either classical randomized rounding or the new approaches 
seems to be [5]. It compares the different methods on randomly generated rounding problems. 
The purpose of this work is to extend these results to two less artificial problem classes, namely 
routing and covering problems. These problems are among the first ones for which randomized 
rounding has been proven (by theoretical means) to lead to good algorithms. 
Randomized Rounding: Given an arbitrary real number x, we say that (the random variable) 
y is a randomized rounding of x, if y equals [x\ + 1 with probability {x} := x — [x\ and |_a;J 
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otherwise. In simple words, the closer x is to the next larger integer, the higher the chance of 
being rounded up. 

Randomized rounding builds on the simple observation that this keeps the expectation un- 
changed, that is, E(y) = x. This naturally extends to linear expressions. If j/i, . . . ,y n are ran- 
domized roundings of Xi, . . . , x n and / : R™ — ► R is a linear function, then E(f(yi , . . . , y n )) = 
f(xi, . . . , x n ). If, in addition, the yi are independent, then Chernoff bounds allow strong quanti- 
tative statements showing that with high probability, f(yi , . . . , y n ) is not far from its expectation. 
These two key facts allow to use randomized rounding in connection with integer linear program- 
ming. Two examples of this are given in the following sections. 

The new aspect of the works [4, 7, 8, 21] is that they allow to generate randomized roundings 
that satisfy certain cardinality constraints with probability one. That is, for certain sets /, we can 
prescribe that ^2 ieI yt — X)ie/ x i> provided the right-hand side is integral. This can be done 
without giving in with the other properties — both methods generate randomized roundings that 
admit the same Chernoff bounds as independent randomized rounding. 

For reasons of space, we cannot describe these rounding algorithms here. However, since in 
this paper we are mainly comparing them experimentally, the reader may treat them as black-box, 
keeping in mind only that they generate randomized roundings that look independent, but satisfy 
cardinality constraints. 

We shall concentrate on disjoint cardinality constraints. This is the most common form of 
cardinality constraints. Also, the comparison of the methods is more interesting here, since all 
have the same time complexity. 

Our Results: The aim of this work is to find out how well the different rounding approaches are 
suited to solve classical problems that are often attacked with LP-based methods, but also to try 
to find fine-tunings and alternative approaches. 

As underlying problems we chose the classical low-congestion routing problem and the max- 
coverage problem. They are different in flavor since in the first, randomized rounding is used with 
a focus of exploiting Chernoff bounds in linear constraints and objective function. In the second, 
since the right-hand side of the inequalities is one, we cannot do so, but resort to accepting that a 
certain fraction of the vertices covered in the relaxation are not covered after rounding. 

All our results indicate that generally the derandomized algorithms yield superior results. The 
increase in running time over the randomized versions usually is still negligible compared to the 
complexity of solving the LPs involved. 

For the low-congestion routing problem, we regard routing requests placed randomly on a 
two-dimensional grid. We regard instances small enough to compute the optimum solutions via 
solving an integer linear program. We observe that randomized rounding with cardinality con- 
straints obtains reasonably good solutions. Surprisingly, unlike in previous experiments [5], we 
observe that the bit-wise randomized approach of [4] produces better results than the tree-based 
one of [21]. 

The gap to the optimum is roughly halved if we use a derandomization of randomized round- 
ing. Here, the derandomization of [21] obtained in [5] proved to be superior. 

In an attempt to fine-tune randomized rounding, we propose solving a second LP which gives 
the same congestion in the relaxation, but aims at making the solution easier to round. While 
this naturally does not give improved theoretical guarantees, it yields a good reduction of the 
rounding errors, in particular, in combination with the derandomization. This seems to be a fruitful 
approach whenever the additional cost of solving a second LP is admissible. 

Our analysis of maximum coverage shows that both randomized rounding and the greedy 
algorithm produce good results in general. However, for both there are instances showing the 
other behave much better. Analyzing the data produced in our experiments, we consider two 
paths to a hybrid approach. One way is to strengthen the derandomization to include a greedy 
component in variable selection, as a gradient-based rounding; the other, complementary, is to 
spend part of the budget greedily, and solve the remaining instance via an LP- and randomized 



rounding-approach. Both hybrids perform better than either of the two plain approaches alone; 
the gradient-based rounding performs particularly well. 

For a natural planar Euclidian version of the problem, we also give a PTAS. However, unlike 
for all other approaches used in this paper, the experimental results are not much better than the 
theoretical guarantees. In consequence, this is an alternative useful only if very good approxima- 
tions are needed and if computation power is available plentiful. 

2 Randomized Rounding for Low-Congestion Routing 

2.1 The Low-Congestion Routing Problem in Networks 

The low-congestion routing problem is one of the classical applications of randomized round- 
ing [20]. In its simplest version, the objective is to route a number of requests through a given 
network, minimizing the maximum usage of an edge ("congestion"). Problems of this type found 
all kinds of applications, an early one being routing wires in gate arrays [10]. 

We shall regard the following basic variant, previously regarded also in [8, 12, 19-21]. 
Given is a (directed) network G = (V,E), together with k routing requests. Each consists 
of a source vertex Sj, a target vertex ti and a demand r"j. The objective is to find, for each 
i G [k] := {1, . . . , k}, a flow from Si to ti having flow value r, e N, such that the congestion, 
that is, the maximum total flow over an edge, is minimized. This problem is easily formulated as 
integer linear program (ILP): We minimize the congestion C subject to the constraints 

k 

VeeE:^2x le <C (1) 

i=i 

\fi € [k] : J^ x ie - ^2 x ie — n (2) 

e=(«i,t))eE e=(v,Si)£E 

Vi e [k}Vv€ V\{si,ti} : J^ x ie = ^ x ie 0) 

e—(w,v)^E e—(v,w)^E 

Vie [fc]Vee£:,i !e e{0,l}. (4) 

We should add that [20] only regard the special case of all demands r$ being one, since ran- 
domized rounding respecting cardinality constraints with right-hand side greater than one was 
not available at that time. For an application with particular need for larger r%, see the failure 
restoration problem in optical networks described in [8]. Also, we should add that other authors 
in addition have edge capacities c e and then minimize the relative congestion, but it is easily seen 
that this just replaces the C in the first type of constraints by c e C. 

Since already the case of unit demands is NP-complete, optimal solutions seem difficult to 
obtain. The common solution concept is to (i) solve the linear relaxation of the ILP and obtain a 
fractionally optimal solution (x*, C*)\ (ii) use path stripping to decompose each flow /j encoded 
in x* into a weighted sum /j = ^2 PeV . y* P fp, where Vi is a finite set of Si-ti paths, for each 
P E Vi, ,fp is the flow that has exactly one unit on each edge of P, and y* P <G [0, 1] — note that all 
this implies X)pgp V*iP ~ r «' Ciii) use randomized rounding to round all y* P to yip e {0, 1} in 
such a way that the cardinality constraints ^2 PeV . yip — Ti are maintained. Now ^2 Pe -p. yipfp 
is a flow from s, to ti with flow value r t . These flows form a solution having a congestion of 
C = maxegE ^2i-i ^2pev- eeP ViP- Large deviation bounds show that this congestion is not far 
from the value C* given by the relaxation [12, 20]: 

C = 0( . ^^ YifC*<logm; 
\log(21ogm/C*)/ 

C = C* +0(VC*logm), ifC* >logm. 



Recall that C* is a lower bound for the optimal solution. Hence if C* is not too small compared 
to m, then this approach gives very good approximation factors. 

2.2 Algorithms Used 

To approximately or exactly solve our test instances, we used the following algorithms. Whenever 
running times permitted, we used the exact ILP-Solver 1LOG CPLEX 1 1 .0 to directly solve the 
ILP given by (1) to (4). All other approaches involve solving the linear relaxation of the ILP (for 
which again we used CPLEX) and then different rounding methods. 

Since the ILP contains hard cardinality constraints, we cannot use the classical independent 
approach of Raghavan and Thompson (we did so, though, ignoring the cardinality constraints, 
to see if the cardinality constraints make rounding more difficult). There are two approaches to 
generate randomized roundings respecting cardinality constraints due to Srinivasan [21] and the 
first author [4]. Both can be derandomized [4, 5], so that in total we have four rounding methods 
available. See the original papers or [5] for a more detailed discussion of these methods. All 
algorithms different from CPLEX were implemented in C/C++. 

2.3 Experimental Set-up 

To analyze the questions discussed in the introduction, we regarded the following type of in- 
stances. Motivated by the fact that many routing problems have a two-dimensional flavor (e.g., 
the wire routing problem of [10]), we chose a finite two-dimensional bi-directed grid. Note that 
this simple graph is far from trivial for routing, see, e.g., the thrilling one-turn routing conjecture 
in [10], which is, to the best of our knowledge still open. 

We choose routing requests randomly as follows. Both Si and ti are chosen uniformly at 
random from V . To reduce otherwise the influence of randomness, we choose all demands in our 
main set of experiments as r^ = 3; in a second set of experiments we pick each r^ uniformly at 
random from {1, . . . , 5}. We also tried placing the Si, ti uniformly at random on the outer border 
of the grid, but saw no significant differences. 

The size n of the grid and the number of demands k was varied to create different instance 
sizes and densities. All numerical values reported are the averages over at least 100 runs. The 
times were measured on AMD dual processor 2.4 GHz Opteron machines. 

2.4 Analysis 

A summary of our results is presented in Tables 1 and 2. For the grid sizes 5x5, 10x10 and 15x15 
together with demands ranging from 10 to 75, we state the average values of the congestion of an 
optimal solution (line 1 of each table) and the amounts by which a solution computed by one of 
the four randomized rounding approaches (lines 2, 3, 5, and 6) is worse (in percent). Lines 4 and 
7 of the tables refer to an improvement discussed in the subsequent subsection. 

Particularly for instances that do show some congestion, we see that randomized rounding 
yields quite good solutions, much better than what the theoretical bounds would predict. Deran- 
domization is clearly worth the small extra effort (cf. the running times in Table 3), reducing the 
gap to the optimum by roughly a half. The results with randomly chosen demands in Table 2 are 
qualitatively similar to those with uniform demands in Table 1, with perhaps a generally some- 
what larger advantage for the derandomized algorithms. 

Comparing the different methods, surprisingly, our experiments generally show that the bit- 
wise randomized rounding approach of [4] (line 3) produced slightly better rounding errors than 
the tree-based one of [21] (line 2). We do not understand this phenomenon currently. Among the 
derandomizations, as expected and similarly as for random instances [5], the derandomization of 
the tree-based approach of [21] given in [5] is superior to the derandomization of the bit-wise one 
in [4]. This is due to the iterative nature of the latter, see again [5]. 



5x5 


10 


25 


50 


Optimum 


3.37 


6.21 


10.98 


RR [21] 


+9.23% 


+8.70% 


+7.29% 


RR[4] 


+7.13% 


+5.96% 


+5.01% 


RR+ 


+6.35% 


+4.99% 


+4.28% 


DeRR [4] 


+3.77% 


+3.54% 


+2.37% 


DeRR [5] 


+2.76% 


+1.93% 


+0.82% 


DeRR+ 


+1.19% 


+1.13% 


+0.64% 



15 x 15 
Optimum 
RR [21] 
RR[4] 
RR+ 

DeRR [4] 
DeRR [5] 
DeRR+ 



75_ 

14.76 
+5.96% 
+4.13% 
+2.64% 
+1.90% 
+0.88% 
+0.47% 

10 

+42.29% 
+40.31% 
+40.26% 
+23.42% 
+13.76% 
+16.59% 



10 x 10 


10 


25 


50 


Optimum 


2.19 


3.41 


5.59 


RR [21] 


+32.17% 


+39.88% 


+31.13% 


RR[4] 


+30.43% 


+36.07% 


+27.01% 


RR+ 


+31.85% 


+24.63% 


+18.60% 


DeRR [4] 


+22.33% 


+25.81% 


+15.56% 


DeRR [5] 


+16.38% 


+22.58% 


+11.63% 


DeRR+ 


+11.81% 


+10.85% 


+7.87% 



75_ 

7/76 
+24.48% 
+19.07% 
+12.50% 
+10.95% 
+8.38% 
+4.25% 



25 
2J3 
+58.97% 
+59.34% 
+45.79% 
+38.10% 
+27.84% 
+ 17.22% 



50 
431 
+52.07% 
+46.94% 
+36.98% 
+31.63% 
+21.90% 
+18.49% 



75 
531 
+45.70% 
+41.40% 
+28.49% 
+25.81% 
+18.28% 
+12.90% 



Table 1: Congestions achieved by the 7 different approaches for grid sizes 5 x 5, 10 x 10 and 15 x 15. All 
demands are chosen as r, = 3. The optimum was computed by solving the IP via CPLEX (not feasible for 
larger instances). For the other algorithms we state the relative increase of the congestion over the optimum. 



We also used classical independent randomized rounding. Clearly, this does not produce feasi- 
ble solutions in most cases. However, even ignoring this issue, we also observed that we typically 
have slightly larger congestions (e.g. in the sparse instance of 10 demands in a 15 x 15 grid, 
independent rounding lead to a congestion of 3.19 compared to congestions of 2.80 and 2.85 for 
the randomized approaches of [4] and [21]). 

Running times consumed by the different stages are mainly given in Table 3. All randomized 
rounding stages for each instance took less than 0.02 seconds. Less than a tenth of this is the time 
needed for the path-stripping in each instance. Hence these numbers are not given in the table. 
From the table, we see that the bit-wise derandomization takes about 20 times longer than the 
tree-based one, but both numbers are greatly dominated by the times for solving the LP (ignore 
the "Heur." line for the moment). 



2.5 A Heuristic Making Life Easier for Randomized Rounding 

As can be seen from the results presented so far, the different randomized rounding approaches 
usually find solutions that are not far from the optimum. We now propose and analyze a heuristic 
way to improve the performance. 

The rough idea is simple. Having solved the linear relaxation of the ILP, we know the optimal 
(relaxed) congestion C* that can be achieved. The congestion we end up with stems from this C* 
plus possible rounding errors inflicted in the congestion constraints (1). It is clear that randomized 
rounding has a higher change to increase the congestion if there are many congestion constraints 
satisfied with equality in the relaxation. 

Therefore, the heuristic we suggest is to resolve the LP with the following modifications. Let 
S <G [0, C*] be a parameter open for fine-tuning. We replace the congestion constraints (1) by 
Ve e E : X)i=i x ie — C* — 5 + z e , where C* is the (fixed) optimal congestion obtained from the 
first LP and z e £ [0, S] are new variables. The new objective is to minimize ^2 eeE z e . Since the z e 
are at most 6, the flow given by a solution of this new LP also yields a congestion of at most C* . 
However, the new objective punishes edges with total flow exceeding C* — 5. In consequence, 
the solution we obtain is also a solution for the original LP, but one that in addition tries to keep 
some room in the congestion constraints. 



5x5 


10 


25 


50 


75 


Optimum 


3.79 


6.65 


11.08 


15.08 


RR [21] 


+12.66% 


+9.47% 


+8.03% 


+6.03% 


RR[4] 


+8.71% 


+6.92% 


+5.23% 


+4.51% 


RR+ 


+6.60% 


+4.36% 


+3.34% 


+2.98% 


DeRR [4] 


+2.64% 


+2.86% 


+2.53% 


+1.92% 


DeRR [5] 


+1.58% 


+1.05% 


+0.99% 


+0.93% 


DeRR+ 


+0.79% 


+0.45% 


+0.63% 


+0.27% 



10 x 10 


10 


25 


50 


75 


Optimum 


2.60 


3.70 


5.86 


7.94 


RR [21] 


+31.15% 


+39.73% 


+28.16% 


+23.93% 


RRL4] 


+27.69% 


+32.70% 


+22.70% 


+18.64% 


RR+ 


+18.08% 


+24.05% 


+ 15.19% 


+14.11% 


DeRR [4] 


+16.92% 


+21.62% 


+ 14.51% 


+11.34% 


DeRR [5] 


+8.85% 


+16.22% 


+10.58% 


+8.56% 


DeRR+ 


+6.15% 


+9.46% 


+6.14% 


+4.16% 



15 x 15 


10 


25 


50 


75 


Optimum 


2.19 


2.99 


4.29 


5.71 


RR [21] 


+47.49% 


+55.52% 


+48.72% 


+40.28% 


RR[4] 


+43.38% 


+47.49% 


+41.26% 


+36.43% 


RR+ 


+39.73% 


+36.12% 


+36.13% 


+26.09% 


DeRR [4] 


+26.48% 


+31.22% 


+27.27% 


+20.49% 


DeRR [5] 


+19.18% 


+22.41% 


+22.14% 


+16.64% 


DeRR+ 


+17.81% 


+15.05% 


+16.78% 


+11.03% 



Table 2: Congestions achieved as in Table 1, but with demands chosen independantly and uniformly at 
random from r; G {1, . . . , 5}. 



5x5 


10 


25 


50 


75 


IP (CPLEX) 


0.0270 


0.1076 


0.343 


0.776 


LP (CPLEX) 


0.0227 


0.1078 


0.301 


0.697 


Heur. 


0.0129 


0.0380 


0.065 


0.096 


DeRR [5] 


0.0009 


0.0006 


0.0015 


0.0028 


DeRR [4] 


0.0126 


0.0270 


0.0538 


0.0589 



10 x 10 


10 


25 


50 


75 


IP (CPLEX) 


0.3678 


6.78 


43.70 


61.52 


LP (CPLEX) 


0.2349 


4.71 


32.03 


34.02 


Heur. 


0.6123 


5.74 


28.88 


51.31 


DeRR [5] 


0.0061 


0.012 


0.018 


0.018 


DeRR [4] 


0.1062 


0.257 


0.420 


0.459 



15 x 15 


10 


25 


50 


75 


IP (CPLEX) 


5.72 


277.7 


2057 


7606 


LP (CPLEX) 


1.61 


48.7 


567 


1135 


Heur. 


9.42 


131.6 


819 


2755 


DeRR [5] 


0.026 


0.044 


0.063 


0.07 


DeRR [4] 


0.375 


0.973 


1.371 


1.67 



Table 3: Running times of the 5 algorithms in seconds. Given is the time for this particular step. For example, 
the running time of what is called "DeRR+" in Table 1 is the sum of the values in lines "LP", "DeRR" and 
"Heur". 



The experimental results are again presented in Table 1 . Line 4 contains the results obtained 
by using randomized rounding as in [4] after applying the heuristic and line 7 does so with the 
derandomization of [5, 21]. We did the same experiments with the other two rounding algorithms. 
Since the results were mainly inferior (to a similar extent as without the improvement), we omitted 
these numbers in the table. In all experiments, we chose 8 = 1. 

The results clearly show that using this heuristic can be worth the extra effort of solving a 
second LP. Apart from two instances with very small objective values 1.98 and 2.19, the heuristic 
always gains us a significant improvement. Surprisingly, these gains tend to be higher when using 
the derandomized rounding algorithm. 

It should be noted, though, that solving the second LP can be costly, as the numbers in Table 3 
indicate. 



3 Maximum Coverage: From Greedy and Rounding to Hybrid 
Approaches 

Another problem where dependent rounding has found application is the Maximum Coverage 
problem. In this problem, the input is a set {Si, . . . , S n } of sets and a budget bound L. The task 
is to select a set of L sets to maximize the size of their union. Additionally, there can be costs c, 
associated with the sets, and weights or profits u>i associated with the elements. In this case the 
task is to maximize the weighted sum of the covered elements, subject to the constraint that the 
total cost of the sets is at most L. 

For the unit-cost case (when all set costs are equal to one), a (1 — 1/e) -approximation can 
be produced easily, either through a greedy algorithm [3] or via randomized rounding [1, 21]; 
see below for details. This is also the best polynomial-time approximation ratio possible unless 
P=NP [6]. For the general case (with weighted budgets), essentially the same ratio is possible 
using both approaches, but some care has to be taken to handle sets of high cost [11,21]. 

In this section, we report on experiments performed on max-coverage instances of various 
types and from different sources, comparing the behavior of the greedy and rounding-based al- 
gorithms. In addition, based on experiences from these experiments, we describe two forms of 
greedy/LP-rounding hybrid algorithms, and observe significantly improved solution qualities. We 
also consider a budget-preserving rounding, improving on that used in [21] (a similar improve- 
ment can be found in the so-called weighted dependent rounding of [7]). The algorithms, and our 
improvements to them, are described in Section 3.1. 

One important special case we consider is max-domination in unit disk graphs, correspond- 
ing to covering points in the plane under Euclidean distance. For this problem, we develop a 
PTAS (polynomial-time approximation scheme) as a further point of comparison. The PTAS and 
more information on the problem setting are given in Section 3.2. Some of our instances for this 
setting come from a real -world facility location problem [15], referenced by the OR Library web 
page [17]; as a source of more benchmark instances, we also convert facility location benchmarks, 
gathered at [22], to a max-coverage setting. See Section 3.3 for more information on our experi- 
mental setup. Thereafter, the rest of the section contains experimental outcomes and conclusions. 



3.1 The Algorithms 

The greedy algorithm repeatedly selects a set fitting in the budget that maximizes the ratio of the 
profit of the newly covered elements to the cost of the set. In the unit-cost case, this is a (1 — 1/e)- 
approximation [3]; with general costs, further modifications are needed, but for the case that 
the budget is large compared to the cost of the most expensive set, as will be the case in our 
experiments, it is still a good approximation [11]. 



For the rounding-based approach, we phrase the problem as an ILP as follows (let n be the 
number of sets, and m the number of elements in the instance). 

in 

max N^ WiXi (5) 

n 

S.t. ^ C iVi ^ L ( 6 ) 

i=\ 

Mi G [m] : x t < ^ yj (7) 

Mi g [m] : x % g [0, 1] (8) 

Mi G [n] : y, G {0, 1} (9) 

Let us first focus exclusively on the unit-cost case. Getting a randomized approximation for this 
case is simple. Solve the linear relaxation of the above formulation, and let (x* , y* ) be an optimal 
solution, of value W* . Applying randomized rounding to y* with a cardinality constraint preserv- 
ing the sum J^i Vt now on expectation produces a (1 — l/e)-approximation, due to the negative 
correlation properties of the rounding [21]. 

The de randomization works via the method of conditional expectation. Consider the expected 
outcome of independently rounding the variables y* above: 

m 

F(y*) = J2 w ^- IT ( l -y*/))- (!«) 

i—l j:i£Sj 

As shown by Ageev and Sviridenko [1], we can use F(y) directly as a guide for the derandom- 
ization, and produce a rounding y G {0,1}™ of y* , such that F(y) > F(y*). As F(y*) > 
(1 — l/e)W*, the rounding y will be a (1 — l/e)-approximation. 

As an additional alternative, we introduce gradient-based rounding. Recall that cardinality- 
preserving randomized rounding works by repeatedly considering pairs of non-integral variables 
and readjusting their values, maintaining the sum, such that one of them becomes integral (see 
e.g. [5]). By gradient-based rounding, we attempt to identify the best pair of variables to select 
for adjustment in each step. To truly find this pair would require 0(n 2 ) comparisons, each with 
cost 0(m), but we can approximate the selection by considering the gradient of F(y). It is easy 
to show that if yi and yj are non-integral, and if q V ' > d , then moving mass from yj to yi 
will keep the value of F(y) non-decreasing. Thus we only need to compute and update the partial 
derivatives a , which can be done analytically at a cost of 0(nm) per step, and we can in 
every step pair off the variables with largest and smallest values of partial derivative. While the 
total complexity of the rounding process becomes 0(n 2 m), as opposed to 0(nm) for standard 
derandomized rounding, the time requirement is small in practice (see Section 3.4). 

Returning to the issue of weighted (knapsack) budget constraints, Srinivasan gives a rounding 
procedure (Lemma 3.1 in [21]) that approximately preserves the value of a weighted sum of the 
rounded variables, while guaranteeing negative correlation properties as in the unit-cost case. 
However, the way in this is achieved for many settings causes infeasible running times. To solve 
the problem, we consider a budget-preserving rounding procedure, as follows. A similar rounding 
is found in the weighted dependent rounding used in [7] . 

Theorem 1. Let y £ [0, 1]™ such that '^2 i ciyi — L. In polynomial time, one can compute a 
rounding y G {0, 1}™ such that 'J2 i ayi < L + max^ a and F(y) > F(y). 

Proof. We refer to e.g. [5] or [1] for a description of the non-weighted, cardinality-preserving 
rounding procedure. The only modification to this procedure is to replace the pair-rounding step, 



where variables yi and yj are adjusted so that one of them becomes integral. Here, instead of 
keeping yi+yj constant, we instead maintain Cjj/j+Cjj/j, that is, we change j/j and yj to g/j — (#/cj) 
and yj + (6/cj) for some appropriate S. We will show that among any pair of adjustments in 
different directions, at least one keeps F(y) non-decreasing. 

Let F(y) be defined as (10), and let two non-integral variables yi and yj be selected, i < j, 
and define 

g a (S) =F(j/i,...,j/ i _i,j/ i + (5,...,j/ j -aS,...,y n ). 

We show that for any a > 0, g(S) is a convex function. Indeed, the only terms of g(5) that are not 
constant or linear in S correspond to elements x G Si<~) Sj: 

w x (l - (1 - yi - S)(l -yj +aS) J| (1 - y k ) 

k:x&Sk,k^{i,j} 

where the only non-linear term is c • <5 2 , for some constant c > 0. 

Thus one of the values g{— 5q) and g(5\), for 8i > are at least as large as g(0) = F(y), and 
the rounding search as in [1] can be performed. 

Finally, the max^ a term occurs because we may end up with one single non-integral variable 
at the end of the process. □ 

As the first paragraph of this proof shows, to combine the gradient-based rounding with this 
rounding process we only have to divide each component i of the gradient by the corresponding 
cost c, before variable selection. 



3.2 A Unit Disk Maximum Domination PTAS 

As a source of real-world instances, we consider a type of max-coverage instances derived from 
planar point set data. Given a set of points P = {p\ , . . . , p n } in the plane and a diameter d, we 
define a graph (the unit disk intersection graph) by letting two points p i7 Pj be connected if and 
only if the Euclidean distance between them is at most d. In this graph, we consider the problem 
of max-domination, where selecting a vertex v covers v and all its neighbors. Interpreted as max- 
coverage, we thus get an instance where every vertex corresponds to one set and one element. All 
sets will have unit cost; the elements may have weights, interpreted as the profit of covering them. 

This problem is NP-hard, as follows from the hardness of Minimum Dominating Set in unit 
disk graphs [16]. However, it has good approximation properties — using tools known from the 
literature (e.g. [9]), we are able to provide a polynomial-time approximation scheme (PTAS). 

The PTAS follows the grid-based shifting strategy of Arora [2], as also applied to a related 
problem on the placement of wireless base stations by GlaBer et al. [9]. We show the result for 
instances with unit costs and arbitrary profits, as this is closest to our instances, and having both 
costs and profits arbitrary makes the problem as hard as max-coverage. 

Theorem 2. For any £ > 1, the Max Domination problem on unit disk graphs with weighted 
vertices and unit costs admits a (1 — 2/ ^-approximation in time n *• '. 

Proof. Assume w.l.o.g. that the diameters of the disks is one, and divide the space into a regular 
grid with unit sides, so that every point resides in one grid box. Repeat the following for all 
values Ihilv € {0, ...,£- 1}. 

Mark every ^:th column, starting from number lh, and every ^:th row, starting from number t v . 
Any point which is a member of a marked row or column gets payoff value 0. Now for every 
subgrid of side / + 1, framed by marked rows and columns, enumerate all optimal solutions as 
functions from the budget used to the payoff achieved. 

Since our concern is the unit-cost case, the number of such payoffs is bounded by the cardinal- 
ity of the solution which dominates all points, i.e., by the size of a minimum dominating set. This 
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in turn is bounded by the independence number, since a maximal independent set is dominating, 
which is 0{l 2 ), since each disk covers an area of 0(1) and the subgrid has total area 0(l 2 ). 

Thus, we can in polynomial time create a vector of the 0(l 2 ) different payoff values that are 
received by investing a certain number t of sets in the current subgrid; for each such value t, the 
optimal solution can be found in n t+ °^ time by enumeration. Thus in n 0{ - 1 ' time we can get 
the budget:payoff vectors for every framed subgrid. 

Thereafter, we can collect these vectors into a kind of knapsack problem, which can be solved 
by dynamic programming over subgrids and budget, i.e., f(i, t) being the best possible payoff 
for using t of the budget to cover vertices in the first i subgrids. Note that unmarked points in 
different framed subgrids must be covered by different points, and vice versa, an unmarked point 
can only be covered by a point in its framed subgrid. Thus a solution to the instance, after the 
marking step has been performed, decomposes into one solution per framed subgrid, and we are 
able to compute this optimally in polynomial time. 

Finally, every point is marked at exactly one value of £h, thus for some shift value lh at 
most a fraction l/£ of the points are marked; by the same argument, for some value of £ v , at 
most a fraction l/£ of the remaining points are marked. This means that there is a pair of shift 
values £h,£ v for which the computed solution represents at least a fraction (1 — l/£) 2 > (1 — 2/^) 
of the optimal solution. □ 



In our implementation, we replace certain steps by an M1P solver, specifically the exhaustive 
enumeration phase for the subproblems, and the dynamic programming knapsack step. With these 
modifications, execution of the algorithm for non-trivial approximation ratios becomes possible. 
Unfortunately, we will see that even with these modifications, the PTAS approach is inferior to 
the greedy and rounding algorithms for realistic approximation settings. 



3.3 Experimental Setup 

We perform experiments using the greedy algorithm and different combinations of LP-relaxation 
and rounding procedure. 

For the rounding, we use three variants. Recall that the expected value of a single randomized 
rounding equals F(y*), and can thus (unlike in Section 2) be computed exactly. We consider 
three ways of boosting this value. The first is to simply apply randomized rounding 1000 times 
and pick the best result; the second is derandomization, using Srinivasan-type rounding directly 
on F(y), with arbitrary order of variable comparison. The third is the gradient-based rounding of 
Section 3.1. 

For problems with a weighted budget, we use the same three rounding methods, where the 
best-of-1000 and the derandomized roundings use Lemma 3.1 of [21], and the gradient-based 
rounding uses Theorem 1 . Should a rounding exceed the budget constraint, we will greedily dis- 
card sets until the budget bound is reached. 

We use CPLEX for both LP- and ILP-solving; all other algorithms were implemented by the 
authors in C. 

Our instances are of two types. For the unit disk max-domination problem, we use bench- 
mark instances stemming from a real-world facility location problem, previously used in [18] 
and available at [15]. For these instances, a demand is provided with every point; we use these 
demands as profit values. To complement this, we use instances converted from facility location 
problems, in most cases downloaded from UflLib [22]. In both cases, we convert the instances 
to max-coverage by selecting an appropriate distance threshold for membership. In the case of 
the M* instances [14], the distances were pre-scaled by demand values, making the distances 
inappropriate for our use; we remove this scaling, and use the demands as profit values instead. 
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3.4 Experiments 

We now report the results of our experiments. 

To begin with, we show the running times of the different methods on various instances in 
Table 4. Note that the LP solver once again uses a significant fraction of our running time, and 
that performing many random roundings becomes more costly than derandomization, due to the 
need to evaluate the objective value for each solution. The low numbers for the gradient-based 
rounding, as compared to the derandomization, can partly be explained by the gradient-based 
rounding being problem-specific, while the derandomization uses general-purpose code. 

Table 5 shows results for some individual instances (described below). The first we want to 
highlight are the Chessboard and Finite Projective Plane (FPP) instances. These classes, proposed 
in [13], downloaded from [22], are not intended as examples of realistic real-world problems, 
but rather serve to reveal the differences between the different approximation approaches. The 
Chessboard instances are instances on a chessboard with side 3fc, where the elements are the 
squares, and the sets are 3 x 3 subgrids (the set of squares reachable by a king within one step). 
For us, these instances serve as a test for whether an algorithm discovers an optimal tiling. The 
FPP instances are graphs of a regular degree but more complicated structure. Since all instances 
of these classes have equivalent combinatorial structure, we use only one instance per class in our 
experiments. In both cases, the budget is set at the most difficult setting, which turns out to be just 
where the LP-relaxation can cover all or almost all elements. 

The results immediately show the reason to pursue hybrid greedy/rounding algorithms. For 
the Chessboard instance, the LP-optimum is already integral, and thus every LP-rounding-based 
algorithm discovers an optimal solution. On the other hand, these instances are difficult for the 
greedy algorithm, as a few early mistakes, when all sets seem equivalent, will hurt the end tiling. 
In the FPP instances, however, we see the opposite effect. Here, upon inspection, we find that 
the LP-optimum is a useless mix of taking an equal, small amount of almost every variable, 
leaving all the work of finding a good integral solution up to the rounding. On the other hand, the 
greedy algorithm performs very well here; runs with an ILP-solver show that it is at most one step 
away from the optimum. We find that our proposed hybrid, the gradient-based rounding, produces 
consistent top-quality results in both test cases. 

We now focus on the unit disk max-domination problem, with instances as described in Sec- 
tion 3.2. We select the largest instance, with 818 points inscribed in a box of sides 6395 by 3975, 
and use a distance threshold of 400. This was chosen as a good balance, as too small or too large 
values (e.g., 100 resp. 800) creates too simple instances. Figure 1 shows the behavior of the main 
algorithms (excluding the PTAS) for this instance, as depending on the budget. Observe that the 
LP-rounding approach is very powerful for small budgets (up to 20), while further guidance is 
needed for larger budgets. The gradient-based LP-rounding, providing just such guidance, pro- 
duces top values throughout, frequently better than either the greedy or the standard rounding 
algorithms. This instance also appears in Tables 5 and 4 under the name br8 18-400 or br818- 
400-L, where L is the budget bound. Another instance class of the same type is the k-median 
instances [22]. Here we use the one named 1000-10, with a threshold of 1000, occurring in Ta- 
bles 5 and 4 as kmedl-lk or kmedl-lk-L, with Figure 2 displaying the same data as Figure 1. 



Instance 



br8 18-400-30 
kmedl-lk-37 
MR1-060-16.5 



818 

1000 

500 



LP Random- 1000 Derand. Grad. rounding Greedy 



0.36s 
0.97s 
0.36s 



0.80s 
1.46s 
0.74s 



0.11s 
0.22s 
0.05s 



0.09s 
0.25s 
0.04s 



0.33s 
0.90s 
0.14s 



IP 



25s 

> 3600s 

> 3600s 



Table 4: Running times for various instances and algorithms; the times for the rounding methods exclude 
the time for solving the relaxation. 
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Name 


Size 


Budget 


Greedy 


Chessboard 


144 


16 


130 


FPP (fc = 17) 


307 


17 


290 


br8 18-400 


818 


30 


28054 


kmedl-lk 


1000 


37 


948 


MR1-060 


500 


16500 


1444 



LP 



': once 1000 derand 


gradient 


Optimum 


144 144 144 


144 


144 


200 210 230 


290 


290 


22157 26199 27397 


28448 


28709 


709 817 923 


962 


993-95 


1179 1254 1402 


1445 


1462-94 



Table 5: Experiments on single instances. The data is averaged over 100 runs where the input data is ran- 
domly permuted. The column "LP: once" shows the expected outcome of a single randomized rounding; 
the following three columns show our three rounding methods. The optimum gives the best upper and lower 
bound achieved by an IP solver after one hour of running time. 



In addition, Figure 2 contains a plot of the fraction of the solution weight covered by integrally 
chosen sets (the line "fixed" in the figure). The figure confirms that the LP-rounding approach is 
powerful for small budgets. 

For concerns of clutter, the PTAS is not included in the figures, but its data is given separately 
in Table 6. Note that for every feasible setting, the PTAS is both of lower quality and significantly 
slower than the alternatives. The k-median-instance is omitted, since we lack point data for it. 

Finally, we also consider an instance class with weighted budgets, namely the M* instances 
proposed in [14], again downloaded from [22]. These are instances with uniformly random dis- 
tances (i.e., random membership after conversion to max-coverage), but with facility costs (i.e., 
set costs) chosen so that facilities close to many customers are more expensive. The authors of [14] 
propose that such cost structures would arise in some real-world situations. Table 5 and Figure 3 
give the results for instance MR1 with distance threshold 0.6, under the name MR1-060. In gen- 
eral for this class, we found that the greedy algorithm and the gradient-based rounding method 
produce practically identical results, while the other methods are inferior to this. 



3.5 A Greedy/LP Hybrid. 

Motivated by our results, we consider a different, more general form of greedy/LP hybrid than 
the gradient rounding. Before we commence with the LP-rounding, we allocate some portion of 
the budget to greedy pre-selection, and apply the LP-relaxation and rounding using the remain- 
ing budget to the thus reduced problem. In Tables 7-9 we examine the performance of such a 
hybrid on the br818-400-30, kmed-lk-37, resp. MR1-060-16.5 instances. We see that such a hy- 
brid algorithm with a carefully chosen threshold can produce results superior to either the greedy 
or the rounding algorithm on their own. The benefits for the gradient-rounding approach seem 
less consistent, although improvement is visible for the kmedl-lk instance. We further report that 
with a pre-selection fraction of 0.3, both the Chessboard and the FPP instances of Table 5 receive 
optimal solutions. 



Instance 

br8 18-400-20 
br8 18-400-25 
br8 18-400-30 



PTAS (£ = 3) PTAS (£ = 5) PTAS (I = 7) Greedy 



Value 
20742 
23008 
23951 



Time Value 
1.3s 22857 
1.2s 24683 
1.3s 24909 



Time 

9s 

9s 

10s 



Value 
24129 
25308 
27270 



Time 
69s 

71s 
74s 



Value 
25247 
26907 
28054 



IP 

Value Time 
26192 0.5s 
27670 9s 
28709 25s 



Table 6: PTAS performance compared to the greedy algorithm and IP solver. 



13 







Rounding 


Algorithm 






Relaxation 


Integral part 


Expectation 


Random- 1000 


Derand 


Gradient 


LP 


4646 


21871 




26251 


27352 


28447 


Hybrid 0.1 


12635 


24936 




27500 


27865 


28328 


Hybrid 0.2 


22921 


27189 




28365 


28141 


28476 


Hybrid 0.3 


28188 


28304 




28339 


28338 


28339 


Hybrid 0.4 


28215 


28215 




28215 


28215 


28215 


Hybrid 0.5 


27817 


26837 




28050 


28019 


28082 


Hybrid 0.6 


27970 


27530 




28048 


28048 


28058 


Hybrid 0.7 


28058 


28058 




28058 


28058 


28058 


Hybrid 0.8 


28054 


28054 




28054 


28054 


28054 


Hybrid 0.9 


28054 


28054 




28054 


28054 


28054 



Table 7: Results for combining greedy pre-selection with randomized rounding, averaged over 100 runs, 
complete results. The data is for the instance br8 18-400-30, where the greedy algorithm alone produces 
value 28054. 







Rounding Algorithm 






Relaxation 


Integral part 


Expectation 


Random- 1000 


Derand 


Gradient 


LP 


0.17 


700 


816 


923 


961 


Hybrid 0.1 


137 


755 


857 


940 


970 


Hybrid 0.2 


312 


817 


904 


957 


974 


Hybrid 0.3 


757 


925 


962 


966 


972 


Hybrid 0.4 


797 


934 


960 


959 


964 


Hybrid 0.5 


915 


947 


954 


954 


955 


Hybrid 0.6 


929 


948 


952 


952 


953 


Hybrid 0.7 


931 


947 


950 


951 


950 


Hybrid 0.8 


947 


948 


948 


948 


949 


Hybrid 0.9 


948 


948 


948 


948 


948 



Table 8: Results for combining greedy pre-selection with randomized rounding, averaged over 100 runs, 
complete results. The data is for the instance kmedl-lk-37, where the greedy algorithm alone produces 
value 948. 
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br81 8-400 




LP-relaxation 

LP+gradient 

LP+derandomization 

LP+random (1000) 

LP-expected 

Greedy algorithm 

Approximation bound 



30 35 

Budget 



50 



Figure 1: Results for unit disk max-coverage instance br818-400. The plot shows the value of the LP- 
relaxation, the outcome of the three rounding methods, and the expected value of a single rounding against 
the greedy algorithm. The approximation bound shows (1 — 1/e) times the LP optimum. 
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Figure 2: Experimental outcome for instance kmedl with distance threshold 1000. The plot shows the value 
of the LP-relaxation, the outcome of the three rounding methods, and the expected value of a single rounding 
against the greedy algorithm. The approximation bound shows (1 — 1/e) times the LP optimum. 







Rounding 


Algorithm 






Relaxation 


Integral part 


Expectation 


Random- 1000 


Derand 


Gradient 


LP 





1179 




1257 


1381 


1446 


Hybrid 0.1 


369 


1232 




1301 


1402 


1450 


Hybrid 0.2 


624 


1285 




1344 


1421 


1443 


Hybrid 0.3 


821 


1316 




1371 


1413 


1447 


Hybrid 0.4 


974 


1349 




1391 


1432 


1448 


Hybrid 0.5 


1109 


1381 




1413 


1436 


1450 


Hybrid 0.6 


1226 


1408 




1432 


1438 


1441 


Hybrid 0.7 


1355 


1431 




1443 


1440 


1440 


Hybrid 0.8 


1440 


1446 




1440 


1445 


1440 


Hybrid 0.9 


1443 


1443 




1443 


1442 


1443 



Table 9: Average value and standard deviation for all ways to relax-and-round one instance (MR1, distance 
0.6, budget 16500). The pure greedy algorithm gives 1444. LP bound is 1503, optimum between 1462 and 
1494. 
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Figure 3: Experimental outcome for instance MR1 with distance threshold 0.6. The plot shows the value of 
the LP-relaxation, the outcome of the three rounding methods, and the expected value of a single rounding 
against the greedy algorithm. The approximation bound shows (1 — 1/e) times the LP optimum. 
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